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Abstract 

We present a self-consistent model for explosive financial bubbles, which combines a 
mean-reverting volatility process and a stochastic conditional return which reflects non- 
linear positive feedbacks and continuous updates of the investors' beliefs and sentiments. 
The conditional expected returns exhibit faster-than-exponential acceleration decorated 
by accelerating oscillations, called "log-periodic power law." Tests on residuals show 
a remarkable low rate (0.2%) of false positives when applied to a GARCH benchmark. 
When tested on the S&P500 US index from Jan. 3, 1950 to Nov. 21, 2008, the model 
correctly identifies the bubbles ending in Oct. 1987, in Oct. 1997, in Aug. 1998 and the 
ITC bubble ending on the first quarter of 2000. Different unit-root tests confirm the high 
relevance of the model specification. Our model also provides a diagnostic for the dura- 
tion of bubbles: applied to the period before Oct. 1987 crash, there is clear evidence that 
the bubble started at least 4 years earlier. We confirm the validity and universality of the 
volatility-confined LPPL model on seven other major bubbles that have occurred in the 
World in the last two decades. Using Bayesian inference, we find a very strong statisti- 
cal preference for our niodel co mpared with a standard benchmark, in contradiction with 



Chang and FeigenbaumI [|2006|l which used a unit-root model for residuals. 
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1 Introduction 



We present a self-consistent model for explosive financial bubbles, with nonlinear positive feedbacks 
with mean-reversal residuals. The conditional expected returns exhibit faster-than-exponential ac- 
celeration decorated by accelerating oscillations. An essential advance of our model compared with 
previous specifications such as that of Johansen-Ledoit-Sornette (1999) is to allow for stochastic 
conditional expectations of returns which describe continuous updates of the investors' beliefs and 
sentiments. 

Two different modeling strategies lead to the same final model specification: (i) a rational-expectation 
(RE) model of rational bubbles with combined Wiener and Omstein-Uhlenbeck innovations describ- 
ing the dynamics of rational traders coexisting with noise traders driving the crash hazard rate; or 
(ii) a behavioral specification of the dynamics of the stochastic discount factor describing the overall 
combined decisions of both rational and noise traders. 

Tests on residuals show a remarkable low rate (0.2%) of false positives when applied to a GARCH 
benchmark. When tested on the S&P500 index from Jan. 3, 1950 to Nov. 21, 2008, the model 
correctly identifies the bubbles ending in Oct. 1987, in Oct. 1997 and in the summer of 1998 and the 
ITC bubble ending on the first quarter of 2000. Different unit-root tests confirm the high relevance of 
the model specification. Our model also provides a diagnostic for the duration of bubbles: applied to 
the period before Oct. 1987 crash, there is clear evidence that the bubble started at least 4 years earlier. 
Using Bayesian inference, we find a very strong statisti cal preference for our model co mpared with a 
standard benchmark, in contradiction with the result of IChang and FeigenbaumI 11200611 . Our positive 
result stems from the mean-reverting structure of the residuals of the conditional returns modeling 
the bubbles, which is shown to be essential in order to obtain a consistent model. Absent in previous 
specifications, this feature constitutes the main advance of this work, leading to the novel positive 
results. The same tests performed on seven major bubbles (Hong Kong 1997, ITC 2000 bubble. 
Oil bubble ending July 2008, the Chinese bubble ending in October 2007 and others) suggest that 
our proposed volatility-confined LPPL model provides a consistent universal description of financial 
bubbles, namely a super-exponential acceleration of price decorated with log-periodic oscillations 
with mean-reverting residuals. 

The present work offers an innovative way to break the stalemate in the ex-an te detection of bub- 
bles, which has been much discussed in the literature. For instance, Gurkaynak 1I2OO81I summarizes 
econometric approaches applied to the detection of financial bubbles, stating that the "econometric 
detection of asset price bubbles cannot be achieved with a satisfactory degree of certainty. For each 
paper that finds evidence of bubbles, there is another one that fits the data equally well without allow- 
ing for a bubble. We are still unable to distinguish bubbles from time-varying or regime-switching 
fundamentals, while many small sample econometrics problems of bubble tests remain unresolved." 

Bubbles are often defined as expone ntially explosive pr ices, which are followed by a sudden col- 
lapse. As summarized for instance by iGurkaynakI [|2008n . the problem with this definition is that 
any exponentially growing price regime, that one would call a bubble, can be also rationalized by 
a fundamental valuation model. This is related to the problem that the fundamental price is not di- 
rectly observable, giving no indisputable anchor to understand how observed prices may deviate from 
fundamental values. This was exemplified during the last Internet bubble culminating in 2000 by 
fundamental pricing models, which incorpora ted real options in the fundamental valuation, basically 
justifying any price. iMauboussin et al.l 11 199911 were among the most vocal proponents of the proposi- 
tion offered close to the peak of the Internet bubble, that better business models, the network effect, 
first-to-scale advantages, and real options effect could account rationally for the high prices of dot- 
com and other New Economy companies. These interesting views expounded in early 1999 were 
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in synchrony with the general positive sentiments of the bull market of 1999 and preceding years. 
They participated in the general optimistic view and added to the strength of the herd. Later, after the 
collapse of the bubble, these explanations seemed less attractive. 

Our model addresses in an innovative way this problem of defining and identifying bubbles. It ex- 
tends in a novel direction a class of processes that have been proposed to incorporate the positive feed- 
back mechanisms that can push prices upward faster-than-exponentially. This faster-than-exponential 
characteristics is one of the main diagnostic that we consider for a bubble. Many financial economists 
recognize that positive feedbacks and in particular herding is a key factor for the growth of bubbles. 
Herding can resul t from a variety o f mechanisms, such as anticipation by rational investors of noise 
traders str ategies ILo ng et al.l. ll990|] . agency costs and monetary incentives given to competi ng fund 
managers jPass et al.i. i2008l ] sometimes leading to the extreme Ponzi schern es nPiniitriadiL 120041 . 
rational imitation in the presence of uncertainty jRoehner and SornetteL |2000|], and social imitation. 
The relevance of social irnitation or "word-of-mouth" effects has a long history (see for instance 
nShilled niOOOn . lHong et al.l [|2005n i for recent evidence). Our approach is to build on previous speci- 
fications that describe faster-than-exponential growth of price (coin ed hereafter "super-exponential") 
llSornette and Andersenl I2002L ISornette. Takavasu. and Zhoull2003ll. 



The Johansen-Ledoit-Sornette (JLS) model flJohansen et al.L 1 199911200011 constitutes a first attempt 
to formulate these ingredients into a traditional asse t pricing rnodel. S tarting f rom the rational expec- 
tation model of bubbles and crashes developed by iBlanchardI lll979ll and by iBlanchard and Watson 
[|l982|l . the JLS model considers the critical properties inherent in the self-organization of complex 
systems. In the JLS model, the financial market is composed of two types of investors: perfectly ra- 
tional investors who have rational expectations and irrational traders who are prone to exhibit herding 
behavior. The dynamics of the price is described by the usual geometric Brownian motion plus a jump 
process controlled by its crash hazard rate. The noise traders drive the crash hazard rate according 
to their collective herding behavior, leading its critical behavior. Due to the no-arbitrage condition, 
this is translated into a price dynamics exhibiting super-exponential acceleration, with possible addi- 
tional so-called "log-periodic" oscillations associated with a hierarchical organi z ation and dynamics 
of noise traders. Using the stochastic discount factor (SDF), ISomette and Zhoul ll2006ll extended the 
JLS model to include inter- temporal parameters and fundamental economic factors. 

In the Johansen-Ledoit-Somette (1999, 2000) model, the logarithmic return is drawn from a nor- 
mal distribution with a time-varying drift. 



Hf , 1 — Hf , 



where 



A - B{t, - UY 



1 + 



C 



cos(u;ln(tc — U)) + 



(1) 



(2) 



This so-called log-periodic power law (LPPL) dynan iics given by Q has b een previously proposed 

in diff e rent forms in various papers (see fo r instance ISo mette e t al.l [Il996ll. Feigenbaum a nd Freund 

1 1996 1. Johansen and Sornettd [ll999l . l200ll] . lFeigenbaumi [,2001.1 . lzhou and Sometta 1,2003a] . IProzdz et al, 
ll2003l] .lsornettel ll2004bl] ). The power law A—B{tc—ti)^ expresses the super-exponential acceleration 
of prices due to positive feedback mechanisms, alluded to above. Indeed, for i? > and < /? < 1, 
the rate of change of iJt. diverges as t ^ t^^. The term proportional to cos(u; \\i{tc — ti)) + (j)) describes 
a corr ection to this sup er-exponential behavior, which has the symmetry of discrete scale invariance 
(DSI) llSomettelll998ll . This formulation Q results from analogies with critical phase transitions (or 
bifurcations) occurring in complex adaptive systems with many interacting agents. The key insight 
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is that spontaneous patterns of organization between investors emerge from repetitive interactions 
at the micro-level, possibly catalyzed by top-down feedbacks provided for instance by the media 
and macro-economic readings, which are translated into observable bubble regimes and crashes. A 
common mathematical signature of such critical behavior is found in the power law singularities 
that accompany the faster-than-exponential growth. The additional accelerat ion oscillations may re- 
sult f rom the existence of a discrete hierarchy of the organization of traders llSornette and Johansenl 
199811 . or from the interplay between the inertia of transfor ming information into decision together 
with nonlinear momentum and price-reversal trading styles lllde and Sornettell2002n . 

Previous tests of the LPPL model ([T]) with ^ and its variants belong to the following three main 
types: 

1 . non-parametric tests of the super-exponential behavior a nd especially of the log-periodic o scil- 
latory structure applied to residuals of prices time series Ehou and Somettel l2002l l2003allbll ; 



2. nonlinear least-square fits of price and log-price tirne series [iJohansen and Somettel 12001 



Somette and Johansenll200lLIZhou and Sornettd . l2008LISomette et al.L 1200911 



3. Bayesian methods applied to the time series of returns llChang and Feigenbaumll2006n . 
Each type has limitations. 

• Non-parametric approaches to the LPPL models have focused essentially on testing the statisti- 
cal significance of the log-periodic component of price residuals in bubble regimes ending with 
crashes. In themselves, they do not provide complete tests of the LPPL model ([T]) with Q and 
its variants. 

• Calibrating direc tly price or log-price time series may pro duce spurious high measures of 
goodness of fits ([Granger and Newboldlll974l |Phillipslll986l) . As a consequence of their non- 
stationarity, the goodness of fit may not reflect the properties of the underlying data generating 
process. Indeed, prices or log-prices are to a good approximation generated by non-stationary 
unit-root processes, obtained from the integration of stationary returns. Such integration me- 
chanically reddens the spectrum, damping the high-frequency component of the time series, 
which may lead to the illusion that the generating process is deterministic. 



This problem has led iFeigenbaumI [1200 111 and IChang and FeigenbaumI [|2006ll to propose tests 
of the LPPL model applied to the return time series. Indeed, the LPPL model ([T]) with Q 
also predict a LPPL structure for the returns. The difficulty with this approach is that direct 
filters of the LPPL patterns from daily returns have been unable until now to detect a signal pre - 
dicted to be one-order-of magnitude sma ller than the background noise (IFeigenbaumI feOOlH ; 
see however lSomette and Johansenl 1200111 for a more positive reinterpretation of Feigenbaum's 
results). The standard financial econometric response to this problem is to work with monthly 
or quarterly time scales, so that the volatility is reduced in relative value compared to the drift, 
approximately by the square root of the number of days in a month or in a quarter. Unfortu- 
nately, this is hardly applicable to the problem of detecting and calibrating financial bubbles 
since the signal we are looking for is by construction transient. Therefore, the luxury of long 
time series spanning many months or quarters is not available. If a bubble expands over 4 years, 
this provides only 48 rnonths and 16 quarters, not sufficient to calibrate econometric models. 
Chang and FeigenbaumI ll2006ll later made the first attempt to employ a Bayesian method which 
is better suited for the analysis of complicated time-series models like the JLS model expressed 
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in terms of returns. Through the comparison of marginal likelihoods, they discovered that, if 
they did not consider crash probabilities, a null hypothesis model without log-periodical struc- 
ture outperforms the JLS model. And if the JLS model was true, they found that parameter 
estimates obtained by curve fitting have small posterior probability. Even though the LPPL hy- 
pothesis might be correct, they concluded that researchers should abandon the class of models 
in which the LPPL structure is revealed through the expected return trajectory. 

These problems can be fundamentally traced back to the fact that the JLS model describes a de- 
terministic time-varying drift decorated by a non- stationary stochastic random walk component. In 
accordance with rational expectation, this predetermined deterministic price path is the unbiased ex- 
pectation of a representative rational agent in the market, while the stochastic component describes the 
estimation errors. The problem is that the stochastic random walk component is a variance-increasing 
process, so that the deterministic trajectory strays farther and farther away from the observable price 
path. This is the reason why direct calibration of prices are inconsistent with the estimation of the 
unbiased expectation of prices. And, as we shall demonstrate below, this is also the reason for the 
lack of power of the Bayesian approaches applied to the return time series. 

In this context, the innovation of our approach is to modify the JLS model by a new specification of 
the resid uals, that makes the process co nsis tent with direct price calibration, thus addressing the issues 



raised by lGranger and Newboldl 11197411 and iPhillipsI [| 1 9 8 611 . In a nutshell, the realized observable price 



path during bubbles is attributed to a deterministic LPPL component, while the estimation errors by 
rational investors is modeled by a mean-reversal Ornstein-Uhlenbeck (0-Uj^process. While keeping 
the structure of the model based on time- varying expectations of future returns, the daily logarithmic 
returns are no longer described by a deterministic drift decorated by a Gaussian-distributed white 
noise. Instead, specifying a mean-reversal noise component, the no-arbitrage condition predicts that 
the expected returns become stochastic, which represents the on-going reassessment by investors of 
the future returns. 

Section 2 presents the new model, which we call the "volatility-confined LPPL model", from two 
different perspectives, a first derivation based on rational expectation and an equivalent demonstra- 
tion using the stochastic discount function. Section 3 presents a first battery of empirical statistical 
tests. Applying direct calibrations of the new LPPL specification to prices generated by GARCH(j9, q) 
processes show that the rate of false positives in terms of the detection of bubble regimes is smaller 
than 0.2%. Using tests on residuals of the price calibration method applied to shrinking windows 
converging to the crash of October 1987, we are able to identify a clear bubble regime starting about 
4 years b efore the crash occurred . Section 4 implements the Bayesian analysis, extending the ap- 



proach of iChang and FeigenbaumI 11200611 to our LPPL specification with 0-U residuals. The results 
show a very strong significance of the LPPL model versus a standard benchmark, as the marginal 
likelihood calculated from the data within bubbles prior to the Oct. 1987 crash is about 150 times 
larger than that of models in which daily returns have no LPPL structure. Section 5 presents the re- 
sults of the tests of section 3 to seven other major bubbles (Hong Kong 1997, ITC 2000 bubble. Oil 
bubble ending July 2008, the Chinese bubble ending in October 2007 and others) to confirm that our 
proposed volatility-confined LPPL model provides a consistent universal description of financial bub- 
bles, namely a super-exponential acceleration of price decorated with log-periodic oscillations with 
mean-reverting residuals. Section 6 concludes. 



'in discrete times, it becomes an AR(1) process 
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2 Volatility-confined LPPL model 



Our volatility-confined LPPL model can be obtained in two ways: (i) using the traditional economic 
framework of rational expectation and (ii) on the basis of the Behavioral Stochastic Discount Factor 
(BSDF). Although both derivations lead to the same specification, they provide different and com- 
plementary economic interpretations. In the following two subsections, we present in turn these two 
derivations. 



2.1 Derivation based on the Rational Expectation (RE) condition 

Let us consider a financial market in which a regime shift occurs, changing from a standard GARCH 
process into a bubble phase. The price dynamics in the bubble regime is assumed to be given by the 
following process. 

— = jj,{t)dt + aydY + awdW — ndj , (3a) 
dY = -aVdt + dW . (3b) 

The symbol / denotes the stock index or price of the asset and W denotes the standard Wiener process. 
The time-varying drift leading to the price acceleration which is characteristic of a bubble regime is 
represented by fx{t) and the jump process j takes the value zero before the crash and one afterwards. 
The constant k, denotes the percentage price drops during a crash. The stochastic process Y plays an 
important role in the model. For < a < 1, F is an Omstein-Uhlenbeck process, so that dY and Y 
are both stationary. As we shall see, this property ensures that the calibration of the LPPL model to 
the price time series is consistent, which was not the case for the standard JLS model in the absence 
of Y. Equation (|3b1 ) describes a self-stabilization mechanism occurring in the market that confines 
the volatility to remain bounded during the bubble gestation all the way until the downward jump 
(or crash) occurs. For « = or in absence of Y, the model recovers the original form of the price 
dynamics in the JLS model. The JLS model is therefore nothing but a special case of our model (|3]) 
with (|3b1) . The corresponding version in discrete time of Q with (|3b1 ) reads 

In It+i - In Jj = + o-y (Ft+i - F*) + aw^t - f^^jt , (4a) 
Yt+i = il-a)Yt + et, (4b) 

whereat ~ A^(0, 1). 

Let us assume that the dynamics of the Stochastic Discount Factor (SDF) satisfies: 

^ = -rdt - pydY - pwdW . (5) 
At 

The factor r quantifies the difference between the risk- free interest rate r f and the dividend growth rate 
S (r = rj — 5). The terms pydY and pwdW amount to transforming the objective drift of the return 
process into its corresponding risk-neutral version, via the no-arbitrage condition © written below. 
The SDF can be interpreted as the excess return over the spot interest that an asset must earn per unit 
of risk variance associated respectively with the two processes Y and W. Only these two stochastic 
processes need to be considered in the dynamics of At since any others which are uncorrelated with Y 
and W do not contribute to the pricing of the assets considered here. The SDF A is the pricing kernel 
of the financial market, that reflects the risk-neutral probability measure in which the current intrinsic 
price of any asset is equal to the value of its expected discount future payoffs. When the market is 
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complete and the no-arbitrage condition holds, the product of the SDF with the value process I{t) of 
any admissible self-financing trading strategy implemented by trading on a financial asset must be a 
martingale process, 

A{t)I{t) = E[K{t')I{t') \^t] >t, (6) 

or rewritten in differential form 

EiJrf(A(t)J(t))] = 0, (7) 

where the expectation operator Et^ [ ■ ] represents the expectation conditional on all current disclosed 
information corresponding to the cr-algebra ■ From condition (|7]), we obtain 

d{AtIt) 1 ^ .dKt dit , dAt dit , 
= {-rdt - pyEt^idY)} + {EtMt))dt + avEt^idY) - Kh{t)dt} - ^ ^pia^dt 

i,j=Y,W 

(8) 

= Et^Xl^{t))dt - rdt - Kh{t)dt - ^ "^piajdi + (ay - pY)Eto{dY) 

i,j=Y,W 

= EtMt))dt - rdt - Kh{t)dt - ^ ^pia^dt + (ay - py)(-ae-"(*-*»)FjJdt . 

i,j=Y,W 

The term ^ ^ piaj is the required excess return remunerating all risks at the exception of the crash 

i,j=Y,W 

risk associated with the jump of amplitude k. We will denote it as pS for short. Then, the above 
equation leads to 

EtMt)) = {r + pS) + Kh{t) + «(cry - py)e-"(*-*°Vi„ . (9) 

The dynamics of the crash hazard rate h{t), given by Et^ [dj] = h{t)dt, plays a very important here, as 
it does in the JLS model. Expression Q includes the expected excess return that needs to remunerate 
rational investors for being exposed to the risk of a crash, which can occur with the hazard rate h(t). 
Here as in the JLS model, we assume that the crash hazard rate h(t) is driven by the behavior of 
"noise traders", who herd into successive phases of euphoria and panics. Assuming a dynamics of 
local imitations and herding on a hierarchical network of social influences as in the JLS model, this 
leads to the crash hazard rate following a LPPL (log-periodic power law) process of the type 

Compared with the JLS model, the new ingredient Y in (|3b1 ) translates into an additional term 
proportional to e~"^*~*°)ytQ in expression Rather then being deterministic as in the JLS model, 
the return Et^{p(t)) that is anticipated at time to for the time horizon up to t is a function of the 
specific stochastic realization Yt^ of the 0-U process Y which is known at to- This property captures 
the possible updates of belief of RE investors. Even though RE assumes that a RE investor always 
makes an unbiased estimation of the actual return, it is rational to account for the fact that his/her belief 
would adjust to the flow of available information, i.e., p(ti) = Et^ {~ir'^ -^*2 (x^) = /^(^s), for 

Since Etg{Yt) = e~"'^*~*°^Ff(, by construction of the 0-U process Yt, the simplest specification for 
the drift term p(t) of the price process ([3]), which is compatible with Q, reads 

p(t) = (r + pE) + Kh{t) + a(ay - py)Ft . (10) 

Substituting (flOl) into ([3]), we obtain 

Y = [r + pS + Kh{t) ]dt - apyYdt + (ay + aw)dW (11) 
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Similarly, substituting (flOl) into dH), we obtain the discrete formulation for the dynamics of the loga- 
rithmic returns: 



In/j+i -\nlt = fit + (Ty(Ft+i - Yt) + aw£t 

= [r + pS + K,h{t)] - apYYt + (try + (Tw)£t (12a) 
= [r + pS + Kh{t)] + py(Ft+i - Ft) + (ay + - py)et (12b) 

As explained below equation Q, following the JLS model, we assume that the crash hazard rate 
h{t) follows a deterministic time-dependence, that describes the collective behavior of noise traders 
approaching a critical time at which the probability per unit time for a crash to occur peaks sharply. 
Using a model of social imitation on a hierarchical network of social influences, JLS obtained a 
crash hazard rate obeying a LPPL process. Since r, p, S and k, are assumed constant, the term 
r + pS + Kh{t) is following a LPPL deterministic process AH(t) = H{t + 1) — H(t), where H{t) 
is given by expression Q. 

Then, using (I12bl) . the residual ut = In /t — H(t) of the logarithm of the asset value with respect 
to the deterministic LPPL process is given by 

vt+i -vt = Py(^t+i - ^t) + {pY + (yw - pY)et ■ (13) 

Operationally, the process Ut is nothing but the residuals of the nonlinear calibration of the process 
H{t) to the asset price time series In It. 

We make the hypothesis that price regimes where bubbles dominate are characterized by a strong 
deterministic component H{t) in the log-price dynamics. As a consequence, one can expect that the 
residuals I't remain bounded, so that the log-price re mains "guided" by H( t ). If H (t) was stochastic. 



we would say that In It and H{t) are cointegrated [[Granger and Hallmanl Il99lll . Translated in the 
context of expression ([T3l ). this implies that we consider the case where ay + aw ~ Pv with |ay + 
c"VF — Py\ ^ Py- In this limit, the residuals Ut are stationary and can be taken proportional to Yt, i.e., 
they follow an AR(1) process. Thus, we assume 

Aut = ut+i - i't = -<y^t + Ut (14) 

where Ut is a Gaussian white noise. From (|12bl) and (fT3l) and using the definition of AH{t), we get 

\nlt+i - In It = AH it) + Aut. (15) 

Combining ([T?] ) and (fT4l) . the recursive formula for the logarithmic asset prices reads 

In It+i = \nlt + AHt - «(ln h - Ht) + Ut . (16) 

Equivalently, the equation for the logarithmic return is 

r,+i = ln4^, - ln4 ~ N{AHt.^^,,t, - a(ln4 - HtJ, al{U+i - U)), AHt^^,,t, = Ht^^, - Ht, . 

(17) 

Compared with the conditional probability distribution given by expression ([T]) valid for the JLS 
model, our model introduces a new stochastic term in the drift. This new term «(ln/i. — Ht,) ensures 
that the log-price fluctuates around while remaining in the neighborhood of the LPPL trajectory Ht. 
This formulation ensures the consistency of modeling the log-price by the deterministic LPPL com- 
ponent as a global observable emergent macroscopic characteristics. We refer to model (flTl) as the 
"volatility-confined LPPL model." Obviously, this modeling strategy leading to the general form ([T71) 
holds for arbitrary deterministic models Ht. 
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2.2 Derivation based on the concept of the Stochastic Discount Factor (SDF) 
with critical behavior 

We now present an alternative derivation of the volatility-confined model (flTl) with a LPPL drift 
trajectory (|2l), from a completely different angle compared with the RE bubble model of the previous 
section. Our alternative derivation describes the dynamics of the impact of herding investors on asset 
prices via a novel specification of the stochastic discount factor. This different approach is motivated 
by several weaknesses of the RE model. 

• The RE model segments rather artificially the respective roles of noise traders on the one hand 
and of RE investors on the other hand. The former are assumed to control the crash hazard rate 
only via their herding behavior, and their impact on price is indirect through the no-arbitrage 
condition representing the actions of RE investors that link the conditional expected return to 
the crash hazard rate. 

• Within the logic of the RE model, notwithstanding the deterministic predictability of the crash 
hazard rate obtained via the corresponding deterministic price component, the RE investors 
cannot on average make profit: the RE investors are remunerated from taking the risk of being 
exposed to a crash. Over all possible scenarios, their expected gain is zero. But RE agents 
endowed with different preferences could in principle arbitrage the risk- neutral agents. The 
homogeneity of the RE agent preferences is therefore a limitation of the model. 

Rather than using the interplay between the noise traders driving the crash hazard rate and the 
risk-adverse rational investors acting as market makers, we attribute the characteristics of the price 
behavior to the internal dynamics of the market sentiment. We propose to capture the critical behavior 
of an asset price resulting from the emergent collective organization of the complex financial system 
by a specification of the stochastic discount factor (SDF). 

The starting point is to recognize the existence of critical dynamics (in the sense of complex 
systems) occurring in financial markets. The critical dynamics reflects the herding behavior of imi- 
tational investors, which leads to increasing correlations between the agents translated into financial 
bubbles. Such behaviors result from imperfective information, the use of heuristics and possible bi- 
ases in the judgements of heterogeneous investors. It is therefore natural to combine insights from the 
field of behaviora l finance and the concepts of criticality developed in the theory of complex systems 



[lSornetteLl2004all . 

From a behavioral finance perspective, we refer to Shefrin, who extended the SDF into a so-called 
Behavioral SDF ( BSDF). The BSDF is s upposed to provide a behaviorally-based synthesis of different 



theories of asset pricing jShefrinl . 1200511 . In this approach, the BSDF can be interpreted as a market 
sentiment factor, which according to Shefrin, is not a scalar but a stochastic process reflecting the 
deviation of subjective beliefs described by a certain representative agent (the market itself) relative 
to objective beliefs and of market's equilibrium time discount factor relative to the situation when all 
investors hold objectively correct beliefs. Expressed with discrete times, the BSDF can be defined as 



ST/ N 7r(xj) 



n(xt) 



n(xj) 5 



t 

R,Il 



(18) 



where the exponent stresses that the BSDF embodies the "sentiment" of the market. The term 
'K{xt) denotes the price of a contract that promises a unit- valued payoff, should event Xt occurs at 
time t. TT{xt) is thus the state price of the basic security associated with the time-event pair {t,Xt). 
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n denotes the objective probability density and Pr is the representative investor's subjective belief 
density distribution, which can be derived by aggregating the heterogeneous investor's subjective 
beliefs given a set of adequate state prices. 7/? denotes the coefficient of relative risk aversion of the 
market, g is the interest rate used to discount future payoffs. The term ■ is the product 

of the deviation of market's subjective beliefs to objective beliefs and of market's equilibrium time 
discount rate relative to the objective discount rate. Therefore, it plays the role of a market sentiment 
factor, which we denote by ^{xt) below. Notice that the remaining terms of equation (fTSi) correspond 
to the traditional SDF, which we still denote by A. This leads to express IsF^ [xt] as the product of 
and A, or in continuous time, as 



A(t)^' =<f(t)A(t), (19) 
<^>(^*) = ■ #^ , AW = [^(^*r^«^^*^ (20) 



with 



Armed with this representation (fT9l) . we propose to capture the market critical behavior through 
the dynamics of the market sentiment factor, which is assumed to be characterized by the following 
jump process 

— - = adt — bdj . (21) 

The coefficient a is assumed to be small, as it describes the amplitude of the deviations of the market's 
equilibrium discount rate from the objective discount rate in "normal" times. In contrast, the term dj 
governs the occurrence of a possible catastrophe of the market sentiment resulting from a critical 
collective amplification of pessimism leading to a run-away. When the market operates close to a 
critical point, increasing crowds of bearish investors gather in their social imitational network to drive 
down the market's sentiment which may, as a result, fall sharply with some probability. For all state Xt 
except the most extreme jump-crash associated with state x'^^, we have Pnixt) < n(x(), i.e., investors 
underestimate the risks. On the other hand, Pji{x^^) > n(x'^'')), which means that the whole market 
becomes over-pessimistic at the time when the extreme event is revealed. We also assume that the 
expectation Et{dj) = h(t)dt of the jump process dj defines the hazard rate h(t). The difference with 
the RE model of subsection 12.11 is that, here, h{t) represents the probability for an overwhelming 
synchronized bear raid to occur, conditional on the fact that the raid has not yet happened. As in 
subsection 12. 11 we assume that h{t) follows a deterministic time-dependence with LPPL properties 
that are typical of a critical behavior on a hierarchical network. Using (fT9l ) and (|2T1) . we have 

^ = ^-^ = — + — + —— = -[r-a]dt-hd3+pydY + p^dW . (22) 

For the price process, we use the same model (01) as in the previous subsection and the same 
process ([5]) for the SDF A(t). The main difference with the RE model of subsection 12. II is that the 
dynamics of the asset price given by (Hj) does not have a jump term. Since we attribute the possible 
occurrence of a crash to a phase transition resulting from a herding behavior, it is in accord with 
intuition that the inherent process of the asset price dynamics does not contain jumps. 

Assuming that the financial market is complete and in absence of risk-free arbitrage, the product 
of the rate of change of asset price and of the BSDF should satisfy the martingale ccondition, i.e., 
Et[d{kl'^ It)] = 0. With (1221) and ©, this leads to 

Y = [r + p'^ - a + bh{t)]dt - apyYdt + (cry + aw)dW . (23) 
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This equation has the same structure as expression (fTTj) obtained with the RE model, with just a 
redefinition of the constants r + pTj^r + pTj — a and k ^ h. With the same price dynamics, the 
conditional probability distribution of returns are identical. It is this model (l23l) or equivalently (fTTI) 
that we will calibrate and test in the next sections. 

But, before doing so, let us interpret the economic meaning of the above derivation based on the 
concept of the BSDF with critical behavior. In contrast with the RE model, there is not need here for 
a representative rational investor playing the role of a market maker fixing the price on the basis of 
his rational expectations. The underlying origin of the martingale condition and the mechanism for 
the crash are quite different from that of the RE model. First, we assume that the financial market 
is complete, i.e., Arrow-Debreu securities (A-Ds) are available to all investors that allow a perfect 
replication of the asset value before the crash. In the early stage of the bubble regime, as the whole 
market is over-optimistic, the probability for a sharp price drop is underestimated and taken to be 
vanishingly small. Hence, the price of A-Ds for a crash state is also zero. In this situation, the 
current price of the asset is the aggregation of the prices for all available A-Ds that correspond to 
all expectable states of price variations in the market. As time goes on, the percentage of bearish 
investors becomes larger and larger, as the deviation of the asset price from its fundamental value 
increases. When the fraction of bearish investors approaches the critical value from below, with 
some non-zero probability, the market sentiment may shift to over-pessimic and, as a consequence, 
trigger a sudden jump. This jump occurs as a result of amplified subjectively perceived probability 
for a crash, embodying the now predominant over-pessimistic bias. Because there are not yet A-Ds 
associated to the states corresponding to very sharply declining prices, nobody is able to hedge this 
extreme risk. Therefore, there is a tension hovering over the market, which is described by the hazard 
rate h{t) = Et{dj)/dt, where dj punctuates the dynamics (|2TI) of the market sentiment factor. The 
existence of the hazard rate leads investors to require higher returns to compensate for their risk^. 
This is implemented by the martingale condition, expressing that there is no opportunity for riskless 
arbitrage. However, when the downward jump happens, all investors suddenly find that the available 
A-Ds that replicate the asset price have become cheap. Then, it is rational for them to short sell their 
stock and buy all the A-Ds. Given the absence of A-Ds for extreme drops of stock price, this then 
leads to an arbitrage opportunity. This results in further price fall, fueled by the positive feedback 
of the strategic allocation used by investors (short the asset and long the A-Ds). The crash is thus 
the result of the cumulative effect of this vicious circle, corresponding to a spontaneous breaking of 
equilibrium llSornettelllOOOll . 



3 Tests based on the Ornstein-Uhlenbeck structure of Residuals 
of the LPPL model 

We now describe a first series of empirical tests performed using model (|23] ) (or equivalently (fTTI)). 
supplemented by the LPPL specification Q. One key feature is the Ornstein-Uhlenbeck (0-U) struc- 
ture of the residuals. This suggests that evaluations of our model of a bubble regime should test both 
for the presence of significant LPPL signatures as well as for the 0-U property of residuals. Ac- 
cording to (fT4l) . this translates into an AR(1) test for the residuals obtained by fitting the asset price 
trajectory using a LPPL process Q. We will therefore use two strategies. The first one developed 

^In this model, the stock price in the bubble regime is risk driven. But quite different from the RE model in which only 
the representative RE investor requires a compensation for his exposition to market risks, here all investors in the market, 
irrespective of whether they are rational or irrational, are collectively requiring higher and higher returns as the bubble 
develops. 
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in this section calibrates the asset price and then tests for the 0-U properties for the residuals. The 
second one, which is implemented in section HI uses the equivalent specification (fTTI) on the asset 
returns to develop a Bayesian inference test. 



3.1 Evaluation of GARCH processes to test for errors of type I (false positive) 



Recall that the purpose of this paper is to test the claim that financial bubbles can be diagnosed from 
their super-exponential price dynamics, possibly decorated by log-periodic accelerating oscillations. 
A first approach is to test whether standard financial processes exhibit such signatures. As an illustra- 
tion, let us consider the GARCH (1,1) model 



In J, -In 4 



2 



= (^0 + "(In It^i - In It~2 - /Uo) + Per 



(24) 



1 5 



(25) 



where the innovation z is distributed according to the Student-n distribution (with n degrees of free- 
dom). Estimating this GARCH(1,1) model on the S&P500 index for the US market from Jan. 3, 1950 
to Nov. 21, 2008 at the daily time scale (such that one unit time increment in (l24l) corresponds to 
one day) yields the following parameters: conditional mean of return /iq = 5.4 x 10^^, conditional 
variance ctq = 5.1 x 10"^ ARCH coefficient a = 0.07, GARCH coefficient p = 0.926 and number 
of degrees of freedom of the student distribution close n = 7. 

Calibrating the LPPL specification Q to a given price trajectory will always provide some output 
for the parameters and the residuals. In order to qualify the LPPL calibration, we impose the following 
restrictions on the parameters 

B>0 
0.1<p< 0.9 
6 < < 13 
|C|<1 

These conditions (l25l) can be regard ed as the " stylize d fe atures of LPPL", which were documented 
in many previous investigations (see Ijohanseni ll2004ll and lJohansen and Sornettd [|2006n for reviews 
documenting these stylized facts). The two first conditions B > and O.l < P < 0.9 ensures a faster- 
than-exponential acceleration of the log-price with a vertical slope at the critical time tc- The condition 
6 < u; < 13 constrains the log-periodic oscillations to be neither too fast (otherwise they would fit 
the random c omponent of the d ata), nor too slow (otherwise they would provide a contribution to 
the trend, see lHuang et al.l [|2000n for the conditions o n the statistical significance o f log-periodicity). 
The last restriction \C\ < 1 in (l25l) was introduced by iBothmer and Meisteii [|2003ll to ensure that the 
hazard rate h(t) remains always positive. For the sake of brevity, we refer to conditions (|25l) as the 
LPPL conditions. We also impose the search of the critical time tc to be no further than one year 
beyond the last data point used in the fit. 

Table [U shows the results obtained by calibrating the LPPL specification ^ to synthetic time 
series generated with the GARCH model (l24l) . with the LPPL conditions (l25l) . and the unit-root tests 
on the residuals. We have performed these tests on two sets of 1000 synthetic GARCH time series: 
(i) samples of random lengths, with lengths uniformly distributed from 750 days to 1500 days and (ii) 
samples of fixed length of 1500 days. The unit-root tests are the Phillips-Perron test and the Dickey- 
Fuller test, which are such that a rejection of the null hypothesis Hq implies that the residuals are 
stationary (and therefore are compatible with the Omstein-Uhlenbeck process posited in our model 
presented in the previous section^. Table[T] shows first a very small rate of false positives, i.e., less 
than 0.2% of the 2000 GARCH-generated time series are found to obey the LPPL conditions, and 
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would thus be diagnosed as being in a bubble regime. Secondly, the unit-root tests show that, for 
most residual time series obtained as the difference between the synthetic GARCH time series and 
their LPPL calibration, one can not reject the null, i.e. the residuals are non- stationary. This confirms 
that our model is not a good fit to synthetic GARCH time series. 

3.2 Tests on the S&P500 US index from Jan. 3, 1950 to Nov. 21, 2008 

We now apply the same procedure as in the previous subsection to the S&P500 index in the US from 
Jan. 3, 1950 to Nov. 21, 2008. But we do not have of course the luxury of a large sample of different 
realizations, as for the synthetically generated GARCH time series. Instead, we generate two sets 
of time windows of 750 successive trading days. The first (respectively second) set is obtained by 
sliding windows of 750 days over the whole duration of our data sets with time increments of 25 days 
(respectively 50 days), referred to as windows of type I and II respectively. The first (second) set has 
563 (262) windows. 

In tabled we can see that, for set I (respectively II), a fraction Plppl = 2.49% (respectively 
2.84%) of the windows obey the LPPL conditions (l25l) . This is more than a factor of ten larger than 
the corresponding fraction for the synthetic GARCH time series. For this fraction of windows which 
obey the LPPL conditions, all of them reject the two unit-root tests for non-stationarity, showing that 
the time windows, that qualify as being in a bubble regime according to our model, also give residuals 
which are stationary, as required from the Omstein-Uhlenbeck specification of the residues of our 
model. In contrast, table [2] shows that, as for table [B the large majority of windows give residuals 
for which the null unit-root hypothesis of non-stationarity cannot be rejected. This means that, for 
most windows that do not obey the LPPL conditions, their residuals are non-stationary, providing two 
reasons for diagnosing these windows as being in a non-bubble regime. This result, together with the 
100% rate of rejection of the null hypothesis for non-stationarity for the subset of windows which 
obey the LPPL conditions, provides a strong support for our model. In contrast, for windows that are 
diagnosed to be in a bubble regime, their residues are automatically stationary, in accordance with 
our model. A crucial additional evidence is provided by table [3] which lists the windows that obey 
the LPPL conditions. We find that all of them correspond to periods preceding well-known crashes. 
This confirms that our method for identifying bubbles exhibits a very low rate of errors of type I (false 
positives). 

Summarizing our results obtained so far, we can state that about 97-97.5% of the time intervals 
of 750 trading days within the period from Jan. 3, 1950 to Nov. 21, 2008 correspond to non-bubble 
regimes, rather well described by a GARCH process. We have been able to characterize LPPL sig- 
natures of bubbles that occupy about 2.5-3% of the whole time interval. These percentages suggest a 
highly selective and efficient detection filter. We test further this selectivity by focusing on the classic 
crash of October 1987, to test how well we can diagnose a bubble regime preceding it. We consider 
shrinking windows with increasing starting dates and fixed last date of Sep. 30 1987. We scan the 
starting dates with a resolution of 5 days and stop with the shortest window of size equal to 750 
trading days. We expect that the LPPL conditions and the rejection of the null unit-test hypothesis 
for the residuals should be observed increasingly as the starting date of the windows moves upward 
towards the crash date. Table |4] shows the results for different starting dates, which confirm remark- 
ably well our expectations. The closer the starting date is to the crash date, the larger is the fraction 
-Plppl of windows that obey the LPPL conditions. Of these, a fraction of -PstationaryRcsi.|LPPL = 100% 
reject the null unit-root tests of non-stationarity. Compared with the overall fraction of 2.5 — 3% of 
windows that pass the LPPL conditions over the whole time interval from Jan. 3, 1950 to Nov. 21, 
2008, this fraction rises drastically from about 20% to 100% for the time windows most influenced 
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by the latest part of the time series closest to the crash. This suggests the existence of a regime shift 
from a GARCH-like process to a LPPL bubble regime as time approaches the Oct. 1987 crash. Note 
also that all 43 windows that pass the LPPL conditions have starting dates around the end of 1983, 
suggesting that the bubble that led to the great Oct. 1987 crash started around the beginning of 1984. 
This results is very interesting in so far that it strengthens the interpretation of crashes as the outcome 
of a long maturation process, and not due to proximal causes of the previous few days or weeks. 

The left panel of Fig. \T\ shows the fit of the logarithm of the S&P500 US index with expression 
© over the time interval from Jan. 3, 1984 (the first trading day in 1984) to Sep. 30, 1987. The 
time series of the residuals of this fit is shown in the upper right panel and its partial autoregression 
correlation function (PACF) is depicted in the lower right panel for lags from to 20 days. All values 
of the PACF with lags larger than 1 fall within two standard deviations, indicating the absence of 
linear dependence. Combined with the result of the Phillips-Perron test on this series of residuals 
shown in Table [51 this suggests that these residuals are both stationary (they reject the unit root test 
of non-stationarity) and furthermore they can be closely approximated by an AR(1) process with a 
mean-reverting coefficient —a ~ —0.03. This supports our proposal to model the residuals z/(t) of 
the LPPL as generated by a Omstein-Uhlenbeck process. 

4 Bayesian inference for our modified LPPL model with Ornstein- 
Uhlenbeck residuals 

We now describe the second series of empirical tests performed using model (1231 ) (or equivalently 
(fTTI) ). supplemented by the LPPL specification Q. While the previous section [3] has used the asset 
price to test for the presence of LPPL conditions and has then tested for the Orstein-Uhlenbeck (0-U) 
properties for the residuals, here we use the other equivalent specification (flTl) on the asset returns to 
develop a Bayesian inference test. 

Our approach parallels that of Chang and Feigenbaum (2006) for the implementation of the 
Bayesian inference. But a fundamental difference is that, while their implementation used the speci- 
fication O, our model ( [17] ) contains the additional term — Q!(lnp( — Ht) stemming from the intrinsic 
guiding mechanism associated with the 0-U model of the residuals decorating the deterministic LPPL 
bubble trajectory. We show below that this new term makes all the difference in establishing the sta- 
tistical significance of LPPL properties of asset returns. 

Equation ([T]) suggests that one might detect directly the LPPL signature in returns by removing 
the effects caused by the intrinsic guiding mechanism associated with the 0-U model of the residuals. 
Defining the random variable = —a{\npt — Ht), we define the adjusted return as 

rff = rt-^t = AHt + ut . (26) 

Recall that AHt results directly from the hazard rate and contains the LPPL signal. The residual ut 
should then be a white noise process. The adjusted returns rf^^ defined by ([26l) for the S&P500 US 
index from Jan. 3, 1984 to Sept. 30, 1987 are shown in Fig.[2l The continuous curve shows AHt, 
where the parameters for the process Ht are obtained by a nonlinear least square fit as in the previous 
section. Unsurprisingly, one can see that the deterministic component is very small compared with 
the typical amplitude of the adjusted returns. Note that t he same relative smallness of the LPPL signal 



viewe d in the return time series has been noted earlier |[Feigenbaum[ . [200lL [Chang and Feigenbaum . 



2006[l . It is not clear how to develop a test that directly test for the existence of a significant LPPL 



component in the time series of adjusted returns shown in Fig. [2l 
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The general weakness of the likelihood analysis of log -periodicity on retu rns is not a surprise 
when viewed from the perspective o ffered by the analys is of iHuang et al.l KOOOll . Using numerically 
intensive Monte- Carlo simulations. iHuang et alj [|2000ll showed that, for regularly sampled time se- 
ries as is the case for financial time series, the log-periodic signal is much more significant in the 
cumulative signal than in its first difference (and that using the cumulative signal does not create 
spurious log-periodicity), due to the well-known fact that integration corresponds to low-pass filter- 
ing. This suggests that working on returns, while being the stan dard of econometric studies, may 
actually be sub-optimal in this case. ISornette and JohansenI 11200 ill summarized in their section 9 the 
Monte-Ca rlo tests which have been performed b y various groups to address specifically this problem, 
including jpeigenbaum and Freundl . ll996Lll998|] . both on synthetically generated price levels and on 
randomly chosen time intervals of real financial time series: these tests show the high statistical sig- 
nificance of logperiodicity in the log-price trajectory before the crash of October 1987 and on several 
other bubbles. 

We thereupon turn to the method of Bayesian inference to investigate the statistical significance 
of LPPL features in the return time series. Following the philosophy attached to Bayesian analysis, 
two models can be compared by estimating the ratio of the posterior probability for each model given 
the data, this ratio being called the Bayesian factor. Let Mq denotes the benchmark model and Sq 
its corresponding set of parameters. Similarly, let Mi denotes an alternative model with its set of 
parameters Hi. Then, the Bayesian factor of model Mi compared with model Mq is defined as 
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In this expression, 9m denotes the vector of parameters for model M. The term p( S | Q ; M) 
represents the posterior probability for the set of parameters in model M, given the observed data 
Q. The term p{9m', M) is the prior probability chosen for the parameters 9 in model M. Within the 
framework of Bayesian hypothesis testing, if Bmi,Mo is larger than 1, one should accept the alternative 
model because the posterior probability for its parameters has enjoyed a larger increase from its initial 
prior basis level, which implies that the alternative model can explain the data better than the reference 
model. If the prior probabilities are not too restrictive, and for a large sufficient data set, Bayesian 
inference amounts to comparing the likelihood function of each model and the Bayesian factor test 
tends asymptotically for large data sets to the likelihood ratio test. 

Let us consider the time ser ies of returns sampl e d at th e time instants t G {to, ^i, ^2, ■ " " ? 
For the reference model, as in IChang and FeigenbaumI 11200611 . we choose the Black-Scholes model 
whose logarithmic returns are given by 



N{fL{U-ti^l),(T\ti-ti^l)) 



(28) 



The drift /i is drawn from the prior distribution A^(/ir , cr^) . The variance cr^ of daily returns is specified 
in terms of its inverse r = known as the "precision" in the language of Bayesian analysis. The 
precision describes how precisely the random variable will be known and thus the higher the better. 
The precision is supposed to be drawn as r ~ T{ar, Pt)- 

The alternate hypothesis model is our volatility-confined LPPL model. Recalling expression (flTl) 
with our present notations, the returns of the alternative model are described by 
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where 



^Hi^i_i = B{tc — U^i] 
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The LPPL characteristics of the model for Aif, j_i are encoded in the the vector of parameters ^ = 
{A, B,C, P,LU,(j), tc). For simplicity, we assume that these parameters are drawn independently from 
the following prior distributions: 

A ~ N{fiA,aA) 

B ~ r{aB,l3B) 
C ~ t/(0,l) 

(3 ^ B{ap ,(3f,) (30) 
(p ~ f/(0,27r) 

where F, B and U denote the .T-distribution, ^-distribution and uniform distribution respectively. 
In practice of bayesian inference, the .T-distribution and ^-distribution are often adopted as prior 
probability distribution!^. The .T-distribution is usually used to describe non-negative variable, and 

has the density function is /(x; a, /5) = /5~"r~^(a) x""^exp , with = a/5 and Var{X) = 

r(z) is the gamma function defined as T{z) = t^~^e~^dt. The random variable realized 
between and 1 is usually assigned with beta prior density, which is f{x] a, (3) 
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py ^, where B(z) is beta function satisfying B(u, v) 

of the variable with ^-distribution are E{X) = and Var{X) — ^,^_^_^_^j^^^^_^^^2 • 
set of parameters of the volatility-confined LPPL model is H = (/i, r, a, ^). The prior density for our 
model is given explicitly by the product of all marginal priors for the each parameter 
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the updated posterior density for qi is 

Qi-l, (^LPPL LPPL) - 
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(32) 



^ ^-distribution and ^-distribution are also called conjugate prior family, because by adopting a prior density of Beta 
(Gamma) form one also obtain a posterior density of Beta (Gamma) form, but with different parameters. Although there 
is no necessity to adopt conjugate prior, the conjugate prior propert y is very convenient for i t avoids having to integrate 
numerically to find the normalising constant in the posterior density I Young and SmithLl2005 l. 
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Thus, the conditional density of the returns given the prior parameters reads 

N 

piQ I 0lppl; LPPL) = l[p{q, I Olppl ] LPPL) , (33) 

i=l 

and the log marginal likelihood needed for the computation of the Bayesian factor is given by 

^ = In ^ p{9lppl)p{Q I Olppl ; LPPL)d9LppL^ . (34) 

Expression (l34l) defines nothing but a smoothing of the likelihood function performed with respect to 
some a priori weight for the input parameters. 

Before proceeding to the calculation of expression (|34|) for =^lppl and J^bs and obtain the Bayesian 
factor, we should point out that a major difficulty with the Bayesian inference test lies with the fact 
that the prior distribution is in general unknown to us. This difficulty cannot really be alleviated by 
trying different priors and by checking the corresponding posteriors, because all posteriors are false 
as long as we do know the true a priori distribution of the parameters. We stress that there is a highly 
non-trivial assumption underlying the Bayesian inference test, namely that the parameters can be con- 
sidered as random values: random parameters would need in general an ensemble of different sample 
realizations (or series of experiments), whereas we are interested here in one particular realization (or 
sample). In a sense, the Bayes approach to hypothesis testing assumes that some kind of ergodicity 
on a single sample applies and that the sample is of sufficiently large size. But this needs to be tested 
and it is not a trivial task. 

Given this, we nevertheless pursue, if only fo r the goal of comparing with th e negative results of 



the same procedure applied to the JLS model by IChang and FeigenbaumI ll2006ll . To implement the 



Bayesian inference test, we consider the same data set as before, namely the S&P 500 US index , 
but concentrating on the period from Jan. 3, 1984 to Sep. 30, 1987 to correspond with our previous 
analysis. The constant drift fi, the precision r, coefficient B and C, super-exponential P, circular 
fr equency for log-periodic oscilla tion u and phase term (p are assigned with the same priors as those 
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Chang and FeigenbaumI 11200611 . The coefficient A, which is the final expected price at critical time, 
is taken from a normal distribution with E[A] =6 and Var[A] = 0, 05 to roughly accord with the 
extend of price fluctuations near the critical time. Since tc can be a few days or months after the 
real crash, but with the most probable value just being the crash day, we choose E[tc — t^] = 30 
and standard deviation as ^yVar[tc ~tN] = 30. Additionally, we choose E[a] = ^yVar[a] = 0.05, 
which roughly reflects our estimated results obtained from the test using shrinking windows with a 
fixed last date of Sep. 30 1987 and with time increments of 5 days. The following gives the priors: 
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The integrals in (l34l) for the log marginal likelihood have been estimated by the Monte-Carlo 
method with 10000 sampling values for each integral component. In order to ascertain the validity of 
our numerical estimation of -S^Lppl in (l34l) and to estimate its confidence interval, we have repeated 
these calculation 100 times. We also performed the same calculations for .Sfes and finally get 

=^lppl(2.5% - 97.5%) = 3173.546 - 3176.983 
^Bs(2.5% - 97.5%) = 3169.808 - 3170.097 . 

A difference of the average loglikelihood ^lppl — -^bs of about 5 translates into a very large Bayesian 
factor exp(=^LPPL — =^bs) ~ ~ 150. The Bayesian inference test therefore suggests that our 
volatility-confined LPPL model strongly outpe rforms the Black-Schole s bench mark. 



Our result contrasts decisively with that of iChang and FeigenbaumI [120061] . Using our nurnerica l 
scheme, we were able to reproduce the negative results reported by IChang and Feigenbaum [ 2006 ] 



that the JLS model is not significantly preferred to the benchmark model according to the Bayesian 
inference test. Thus, our new results cannot be ascribed to a spurious numerical implementation 
but reveals the importance of the specification of the residuals. The difference can be traced back 
to the Omstein-Uhlenbeck model of residuals, which make the LPPL fits self-consistent. Given the 
empirical price data, any agnostic economist would have to put more weight on our volatility-confined 
LPPL model than on the standard benchmark without super-exponential growth and log-periodicity. 

In addition, we calculate the log marginal likelihood for the volatility-confined PL (power law) 
model. The PL model is the special case of the volatility-confined LPPL model obtained for C = in 
expression Q. The PL model keeps the super-exponential component but neglects the log-periodic 
oscillatory component. The following compares the log-likelihoods of the two models in their 2.5- 
percentile to 97.5-percentile range obtained over the distribution of their numerical estimations: 

^lppl(2.5% - 97.5%) = 3173.546 - 3176.983 
^pl(2.5% - 97.5%) = 3175.520 - 3178.425 . 

This shows that there is no significant gain in the Bayesian factor when going from the PL model 
to the LPPL model. Actually, the Bayesian factor for the volatility-confined PL model tends to be 
somewhat larger than that of the volatility-confined LPPL model. This should probably be attributed 
to the stronger impact of the priors of the later due to its larger number of parameters, compared with 
the former. 

Indeed, since the Bayes approach suggests to smooth out the likelihoods corresponding to different 
parameter values by an a priori density, it is a legitimate question to ask why such smoothing may 
work. When the sample size n tends to infinity, the maximum-likelihood ML-estimates tend to the true 
values and the likelihood function under the integral in ( |34l ) "cuts out" only a narrow neighborhood 
of the true values. Thus, the behavior of the a priori density outside of this neighborhood becomes 
irrelevant, and the Bayes approach tends to the maximum likelihood approach, of course under the 
condition that the chosen prior would not ascribe zero weight to the true parameter value. However, 
when the sample size is moderate or small and the number of parameters is not small, the situation 
becomes more and more uncertain. The likelihoods can have several (even many) local maxima in 
the present case of log-periodicity. Proponents of the Bayes approach argue that this multiplicity is 
overcome by integration (smoothing). But, for finite sample size n, the smoothing in the marginal 
likelihood may be more harmful (in particular under unfortunate choices of the prior): smoothing and 
its positive effects (suppression or decreasing multiplicity of local peaks) come at the price of a loss of 
efficiency. We believe this could explain the somewhat better performance of the LP model compared 
with the LPPL model within the Bayesian inference tests. 
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In conclusion, we find a decisive preference in favor of the PL and LPPL models against the bench- 
mark model, which supports the claim that the super-exponential property of the price constitutes an 
important characteristics of financial bubbles. 



5 Out-of-sample tests of the volatility-confined LPPL model to 
diagnose other bubbles 

We now apply the above described procedure and tests of significance for the LPPL property to 
different price time series that contain other historical speculative bubbles. Our goal is to test for the 
validity and universality of the volatility-confined LPPL model. 

We proceed in two steps. For each time series to be analyzed, we first calibrate the nonlinear 
model (dl) with ^ to the logarithm of the price. If the LPPL parameters determined from the fit 
for a certain period meet the LPPL conditions (l25l) . a speculative bubble is then diagnosed within 
this period. The volatility-confined LPPL model is then supposed to be applicable. Second, we test 
the 0-U property as well as the order of autoregression of the residuals obtained from the previous 
calibration step in the same time interval. This is a test of the stationarity of the residual time series. 

We consider some of the most important speculative bubbles that have occurred in the World in 
the last decades. Specifically, we study 

• the bubble in the USA as well as in other European markets that led to a crash at the end of the 
summer of 1998 (the so-called Russian crisis), 

• the booming market in Hong Kong in the mid-1990s ending with a crash of October 1997, 

• the ITC bubble reflecting over-optimistic expectation of a new economy ending in the spring of 
2000 with a big crash of the NASDAQ index. 



the so -called oil bubble which started arguably around mid-2003 and ended in July 2008 llSomette et al 



200911 ■ through it marks on the S&P500 index and. 



• the recent Chinese bubble, characterized by crazy ups and downs and a sixfold increase of the 
Chinese indices in just two years, followed by a dramatic drop in a mere half year to one-third 
of its peak value attained in October 2007. We use the Shanghai Stock Exchange Composite 
index (SSEC) and Shenzhen Stock Component index (SZSC), which are two of the major stock 
index in Chinese market. 

Table [6] displays the parameters obtained from the calibration of the LPPL model to these bubbles. 
One can verify that the LPPL conditions B > 0,0.1 < (3 < 0.9, 6 < a; < 13, and \C\ < 1 are met 
for these bubblefl 

Table |7] gives the results of the 0-U test for the residuals obtained from calibrating the nonlinear 
model ([U) with ^ to the logarithm of each time series. Combining the results of the different unit- 
root tests, we conclude that all indices except one have their residuals qualifying as generating by a 
stationary process at the 99.9% confidence level. The exception is the Shenzhen stock component 
index for which the confidence level to reject the null of non-stationarity is 99%. The estimated 
coefficient a of auto-regression associated with the 0-U process is between 0.02 and 0.06. This range 



'^For the SSEC index, the estimated /? is found equal to 0.905, which is barely outside the chosen qualifying interval 
[0.1, 0.9]. Changing slightly by a few days the time window in which the fit is performed puts back the exponent p within 
the qualifying interval. 
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of values corresponds approximately to our choice for the prior distribution of the coefficient a in 
the Bayesian analysis reported in the previous section. The last columns of TablelT] list the order of 
the AR model obtained for the residuals. Two criteria of order selection are tested for robustness. In 
almost all cases, the two different criteria give the same order equal to 1 for the AR model, with only 
one exception being the Hang Seng Index for which the HQ criterion suggests an AR(3). 

The above tests performed on these seven bubbles presented in Tables [6] and |7] suggest that our 
proposed volatility-confined LPPL model, first tested for the bubble and crash of October 1987, is not 
just fitting a single "story" but provide a consistent universal description of financial bubbles, namely 
a super-exponential acceleration of price decorated with log-periodic oscillations with mean-reverting 
residuals. 

6 Concluding remarks 

We have presented a model of bubbles, termed the volatility-confined LPPL model, to describe and 
diagnose situations when excessive public expectations of future price increases cause prices to be 
temporarily elevated. 

To break the stalemate in the literature concerning the detection of bubbles, we have proposed to 
focus on three characteristics: (i) the faster-than-exponential growth of the price of the asset under 
consideration represented by a singular power law behavior, (ii) an accelerated succession of transient 
increases followed by corrections captured by a so-called log-periodic component and (iii) a mean- 
reversing behavior of the residuals developing around the two first components, which by themselves 
form the log-periodic power law (LPPL) model. 

These three properties have been nicely tied together via a rational-expectation (RE) model of 
bubbles with combined Wiener and Ornstein-Uhlenbeck innovations describing the dynamics of ra- 
tional traders coexisting with noise traders driving the crash hazard rate. An alternative model has 
been proposed in terms of a behavioral specification of the dynamics of the stochastic discount factor 
describing the overall combined decisions of both rational and noise traders. 

The test of the volatility-confined LPPL model has proceeded in two steps. First, we calibrated 
the nonlinear model ([T]) with dH) to the logarithm of the price time series under study and diagnosed 
a bubble when the LPPL parameters determined from the fit for a certain period meet the LPPL con- 
ditions (|25l) . Second, we tested for the stationarity of the residual time series. Applied extensively to 
GARCH benchmarks and to eight historical well-known bubbles, we found overall that these bubbles 
obey the conditions for the volatility-confined LPPL model at a very high confidence level (99.9%) 
and that the rate of false positives is very low, at about 0.2%. These results suggest that we have iden- 
tified a consistent universal description of financial bubbles, namely a super-exponential acceleration 
of price decorated with log-periodic oscillations with mean-reverting residuals. 

Further validation will come by testing further on other known bubble cases and in real time. 
These studies are currently underway and will be reported elsewhere. 
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Table 1: Test of the LPPL specifications ^ to synthetic time series generated with the GARCH model 
(|24l) . with the LPPL conditions (|25l) . and the unit-root tests on the residuals. For each type of samples, 
rOOO time series have been generated. 



type of percentage of signif. percentage of not rejecting Hq false positive 



samples 


LPPL condition satisfied 




level 


Phillips-Perron 


Dickey-Fuller 


rate 


random 




a 


= 0.01 


94.1% 


94.1% 


0.2% 


length 


0.2% 


a 


= 0.001 


72.8% 


72.8% 


0.2% 


fixed 




a 


= 0.01 


93.8% 


93.8% 


0.1% 


length 


0.1% 


a 


= 0.001 


72.7% 


72.7% 


0.0% 
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Table 2: Test of the LPPL specifications (O and the unit-root tests on the residuals, for time series of 
750 consecutive trading days of the S&P500 US index in the interval from Jan. 3, 1950 to Nov. 21, 
2008. The first (respectively second) set of windows is obtained by sliding windows of 750 days over 
the whole duration of our data set with time increments of 25 days (respectively 50 days). Plppl 
denotes the fraction of windows that satisfy the LPPL condition. -PstationaryRcsi.|LPPL is the conditional 
probability that, out of the fraction Plppl of windows that satisfy the LPPL condition, the null unit- 
root test for non-stationarity is rejected for the residuals. 



days of number of signif. percentage of not rejecting Hq 

one step windows ^lppl ^g^g^ Phillips-Perron Dickey-Fuller ^stationaryResi.|LPPL 



25 563 2.49% 



50 282 2.84% 



a = 0.01 96.45% 96.45% 100% 

a = 0.001 69.27% 69.27% 100% 

a = 0.01 96.81% 96.81% 100%" 

a = 0.001 70.92% 70.92% 100%" 
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Table 3: Windows of the S&P500 US index in the interval from Jan. 3, 1950 to Nov. 21, 2008 that obey 
the LPPL conditions. Windows of type I (respectively type II) are obtained by sliding a time interval 
of 750 days over the whole duration of our data sets with time increments of 25 days (respectively 50 
days). 



start of window end of window reject Hq for residuals type of sliding step 



May. 7, 1984 


Apr. 24, 1987 


Yes 


I 


Jun. 12, 1984 


Jun. 1, 1987 


Yes 


I&II 


Jun. 18, 1984 


Jul. 7, 1987 


Yes 


I 


Mar. 15, 1991 


Feb. 16, 1994 


Yes 


I&II 


Mar. 25, 1994 


Mar. 13, 1997 


Yes 


I 


May. 3, 1994 


Apr. 18, 1997 


Yes 


I&II 


Jun. 8, 1994 


May. 23, 1997 


Yes 


I 


Jul. 14, 1994 


Jun. 30, 1997 


Yes 


I&II 


Sep. 23, 1994 


Sep. 10, 1997 


Yes 


I&II 


Oct. 28, 1994 


Oct. 15, 1997 


Yes 


I 


Apr. 28, 1995 


Apr. 11, 1998 


Yes 


I&II 


Jun. 5, 1995 


May. 15, 1998 


Yes 


I 


Jun. 11, 1995 


Jun. 21, 1998 


Yes 


I&II 


Sep. 16, 1996 


Sep. 30, 1999 


Yes 


I&II 
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Table 4: Test for the validity of the LPPL conditions and unit-root tests on residuals in windows all 
ending on Sep. 30, 1987 with different starting dates for the S&P500 US index. The smallest window 
size is 750 days. Plppl is the percentage of windows that obey the LPPL conditions in all the test 
windows. -PstationaryRcsi.|LPPL IS the probability that the null unit-root tests for non-stationarity are 
rejected for the residuals, conditional on the fact that the LPPL conditions are met. The unit-root tests 
are also the Phillips-Perron and Dickey-Fuller tests (both produce the same results) with significance 
level of 0.001. 



start of 


number of 


number of series 


-Plppl 


-PstationaryResi. |LPPL 


window 


samples 


satisfy LPPL condition 


Jan. 2, 1980 


242 


43 


17.78% 


100%** 


Jan. 3, 1983 


90 


43 


47.48% 


100%** 


Sep. 1, 1983 


57 


42 


73.68% 


100%** 


Dec. 1, 1983 


44 


43 


97.73% 


100%** 


Mar. 1, 1984 


32 


32 


100% 


100%** 
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Table 5: Phillips-Perron unit root test on residuals of the calibration of the S&P 500 index by the 
LPPL model © with © over the interval from Jan. 3 1984 to Sep. 30 1987. 









Adj. t-Stat 


Prob.* 


Phillips -Perron test statistic 




-4.008 


0.0001 


Test critical values 


0.1% 




-3.588 






1% 




-2.567 






5% 




-1.941 




Model 


Coefficient a 


Std. Error 


s-Statistic 


Prob. 


Ut+i = -avt + Ut 


0.029 


0.0077 


-3.789 


0.0002 


R-squared 


0.015 


Mean dependent var 


-9.95E-05 


Adjusted R-sqaured 


0.015 


S.D. dependent var 


0.0084 


S.E. of regression 


0.0084 




AIC 


-6.7286 


Sum squared resid 


0.0662 




SC 


-6.7234 


Log likelihood 


3186.97 


Durbin- Watson stat 


1.7928 
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Table 6: Parameters obtained from the calibration of the nonlinear model ([U) with (O to the logarithm 
of the different price indices named in the first column. 



index 


^start 


tend 


tc 




UJ 





B 


C 


S&P500 


Jan-03-91 


Apr-30-98 


Jul- 11-98 


0.3795 


6.3787 


4.3364 


0.0833 


0.7820 


FTSEIOO 


Jun-01-94 


May-30-98 


Aug-26-98 


0.4022 


12.1644 


0.9409 


0.0571 


0.8076 


HangSeng 


Jan-03-95 


Jul -3 1-97 


Oct-28-97 


0.7443 


7.4117 


4.9672 


0.0042 


0.7955 


NASDAQ 


Apr-01-97 


Feb-28-00 


May-27-00 


0.1724 


7.3788 


3.2314 


1.0134 


0.9745 


S&P500 


Dec-01-04 


Jul- 15-07 


Oct-26-07 


0.1811 


12.9712 


1.5361 


0.2419 


-0.8884 


SSEC 


Feb-01-06 


Oct-31-07 


Jan-23-08 


0.9050 


7.3538 


2.3614 


0.0054 


-0.6277 


SZSC 


Feb-01-06 


Oct-31-07 


Dec- 14-07 


0.8259 


6.3039 


6.2832 


0.0111 


0.7344 
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Table 7: Stationarity tests on the residuals of the financial indices obtained from the the calibration 
of the nonlinear model ([T]) with (O to the logarithm of the different price indices named in the first 
column. Tripled stars(***) and double stars(**) respectively denote 0.1% and 1% significance levels 
to reject the null Hq that the residual process has a unit root, a is the mean-reverting parameter of 
the Omstein-Uhnlenbeck generating process of the residuals. The orders of the AR model for the 
residuals selected using the Schwarz information Criterion (SIC) and the Hannan-Quinn Criterion are 
listed in the last two columns. 



unit-root test . AR order 

mdex tstart t^nd Phillip s-Pcrrou Dickey-Fuller Coefficients 

S&P500 Jan-03-91 Apr-30-98 -4.454*** -4.594*** 0022 1 T 

FTSEIOO Jun-01-94 May-30-98 -4.731*** -4.893*** 0.045 1 1 

HangSeng Jan-03-95 Jul-31-97 -3.756*** -3.482*** 0.041 1 3 

NASDAQ Apr-01-97 Feb-28-00 -3.849*** -3.759*** 0.037 1 1 

S&P500 Dec-01-04 Jul-15-07 -4.000*** -4.229*** 0.053 1 1 

SSEC Feb-01-06 Oct-31-07 -3.932*** -3.808*** 0.064 1 1 

SZSC Feb-01-06 Oct-31-07 -3.111** -2.960** 0.041 1 1 
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Figure 1: Left panel: fit of the logarithm of the S&P500 US index with expression ^ over the time 
interval from Jan. 3, 1984 (the first trading day in 1984) to Sep. 30, 1987. Upper right panel: time 
series of the residuals of the fit shown in the left panel. Lower right panel: partial autoregression 
correlation function (PACF) of the residuals. The value of the PACF at lag 1 is equal to 0.9709. For 
lags larger than 1, the PACF is bounded between ± two standard deviations. 
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Figure 2: Time series of adjusted returns defined by expression (|26|) for the S&P500 US index from 
Jan. 3, 1984 to Sept. 30, 1987. The smooth continuous line shows the LPPL term AHt, where Ht is 
defined by equation (|2l). 
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